Model Selection

Fine-tuning Optimization

# Fine-tuning Optimization

Mask2former Finetuned ER Mito LD5

An image segmentation model fine-tuned on the Dnq2025/Mask2former_Pretrain dataset, based on the facebook/mask2former-swin-base-IN21k-ade-semantic model

Image Segmentation

Lightblue Reranker 0.5 Cont Filt Gguf

A text ranking model fine-tuned based on Qwen2.5-0.5B-Instruct, suitable for information retrieval and relevance ranking tasks

Large Language Model

Belle Whisper Large V3 Turbo Zh

A Chinese speech recognition model fine-tuned based on whisper-large-v3-turbo, showing significant performance improvements in multiple Chinese speech recognition benchmarks

Speech Recognition

T5 Small Finetuned V2 Hausa To Chinese

A Hausa-to-Chinese translation model fine-tuned based on T5-small, achieving a BLEU score of 30.0183 on the evaluation set.

Machine Translation

Vit GPT2 Image Captioning Model

An image caption generation model based on the ViT-GPT2 architecture, capable of converting input images into descriptive text

Learn Hf Food Not Food Text Classifier Distilbert Base Uncased

This is a text classification model fine-tuned on DistilBERT-base-uncased for distinguishing between food and non-food text content.

Text Classification

Xfinder Llama38it

xFinder-llama38it is a fine-tuned key answer extraction model based on Llama3-8B-Instruct, designed to improve the accuracy and robustness of key answer extraction from large language model outputs.

Large Language Model

Transformers English

Deepfake Audio Detection

A speech processing model further fine-tuned based on wav2vec2-base-finetuned, achieving 98.82% accuracy on the evaluation set

Speech Recognition

Deepfake Audio Detection

A fine-tuned speech processing model based on wav2vec2-base-finetuned, achieving 98.82% accuracy on the evaluation set

Speech Recognition

Paligemma 3b Pt 448

PaliGemma is a lightweight and versatile vision-language model built on the SigLIP vision model and Gemma language model, supporting multilingual image-text interaction tasks.

Tinyllama Essay Scorer

A fine-tuned essay scoring model based on TinyLlama-1.1B

Large Language Model

T5 Small Finetuned Nl2sql

A T5-small fine-tuned NL2SQL model for converting natural language to SQL queries

Large Language Model

Belle Distilwhisper Large V2 Zh

A Chinese speech recognition model fine-tuned based on distilwhisper-large-v2, with a speed 5.8 times faster than whisper-large-v2 and 51% fewer parameters

Speech Recognition

Layout Qa Hparam Tuning

A document QA model fine-tuned based on microsoft/layoutlmv2-base-uncased, suitable for document layout understanding and QA tasks

Question Answering System

Whisper Small Turkish Tr Best

Turkish speech recognition model fine-tuned based on OpenAI Whisper-small, with a word error rate of 26.34%

Speech Recognition

Fine-tuned image-to-text model based on microsoft/git-base

Transformers Other

Distilhubert Finetuned Gtzan

This model is a fine-tuned version of DistilHuBERT on the GTZAN music classification dataset, primarily used for music genre classification tasks.

Audio Classification

Opus Mt Ko En Finetuned

A Korean-English translation model fine-tuned based on Helsinki-NLP's opus-mt-ko-en model

Machine Translation

Saved Model Git Base

A vision-language model fine-tuned on image folder datasets based on microsoft/git-base, primarily used for image caption generation tasks

Transformers Other

Segformer B0 Finetuned Segments Test

An image segmentation model fine-tuned on the bilal01/stamp-verification-test dataset based on nvidia/mit-b0

Image Segmentation

Swin Tiny Patch4 Window7 224 Isl Finetuned

A vision model fine-tuned based on microsoft/swin-tiny-patch4-window7-224, achieving 100% accuracy on the evaluation set

Image Classification

Detr Resnet 50 Finetuned Cppe5

Object detection model fine-tuned on the cppe-5 dataset based on facebook/detr-resnet-50

Object Detection

Videomae Base Finetuned

A video understanding model fine-tuned on an unknown dataset based on the VideoMAE base model, achieving 86.41% accuracy on the evaluation set

Video Processing

Xtremedistil L6 H384 Uncased Finetuned Squad

This model is a fine-tuned version of microsoft/xtremedistil-l6-h384-uncased on the SQuAD dataset, primarily used for question answering tasks.

Question Answering System

Swin Base Finetuned Cifar100

This model is an image classification model fine-tuned on the CIFAR-100 dataset based on the Swin Transformer architecture, achieving an accuracy of 92.01%.

Image Classification

Whisper Medium Jp

Japanese speech recognition model fine-tuned on the common_voice_11_0 dataset based on openai/whisper-medium

Speech Recognition

Transformers Japanese

Xlm Roberta Base Finetuned Urdu

Urdu sentiment classification model based on xlm-roberta-base architecture, capable of binary sentiment classification for Urdu sentences

Text Classification

Transformers Other

Segformer B0 Finetuned Segments Water 2

An image segmentation model fine-tuned on the imadd/water_dataset dataset based on nvidia/mit-b0, designed for water segmentation tasks

Image Segmentation

Resnet 50 Ucsat

An image classification model fine-tuned based on microsoft/resnet-50, demonstrating medium accuracy on an unknown dataset

Image Classification

Distilbert Base Uncased Becas 4

A text classification model fine-tuned on the becasv2 dataset based on distilbert-base-uncased

Large Language Model

Distilbert Base Uncased Finetuned Squad

A question-answering model based on DistilBERT, fine-tuned on the SQuAD dataset for extractive question answering tasks.

Question Answering System

Deberta Base Finetuned Aqa

A QA model fine-tuned on the adversarial_qa dataset based on microsoft/deberta-base

Question Answering System

Convnext Tiny Finetuned Beans

This model is an image classification model fine-tuned on the beans dataset based on the ConvNeXt-Tiny architecture, achieving an accuracy of 96.09%.

Image Classification

Xtremedistil L12 H384 Uncased Finetuned Wikitext103

This model is a fine-tuned version of microsoft/xtremedistil-l12-h384-uncased on the wikitext dataset, primarily used for text generation tasks.

Large Language Model

Wav2vec2 Base Cv

A speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-base

Speech Recognition

Wav2vec2 Large Xls R 300m Pt Colab

A speech recognition model fine-tuned on the common_voice dataset based on facebook/wav2vec2-xls-r-300m

Speech Recognition

Distilhubert Ft Common Language

This model is a fine-tuned audio classification model based on distilhubert trained on a common language dataset, primarily used for language recognition tasks.

Audio Classification

Distilroberta Base Model Transcript

A text processing model fine-tuned based on the distilroberta-base model, suitable for general NLP tasks

Large Language Model

Sagemaker Distilbert Emotion

A text sentiment classification model based on DistilBERT, fine-tuned on the emotion dataset with an accuracy of 92.9%

Text Classification

NER RUBERT Per Loc Org

A lightweight Russian named entity recognition model based on BERT architecture, supporting the identification of three types of entities: person, location, and organization.

Sequence Labeling

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase